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Abstract. I present in this paper a method to calibrate data obtained from optical and infrared interferometers. 
I show that correlated noises and errors need to be taken into account for a very good estimate of individual 
error bars but also when model fitting the data to derive meaningful model parameters whose accuracies are not 
overestimated. It is also shown that under conditions of high correlated noise, faint structures of the source can 
be detected. This point is important to define strategies of calibration for difficult programs such as exoplanet 
detection. The limits of validity of the assumptions on the noise statistics are discussed. 
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1. Introduction 

With optical-infrared interferometry becoming more ma- 
ture, the quality of visibility measurements have become 
an issue. Single-mode interferometers (see Sect. 2.3) allow 
one to eliminate non-stationary effects by filtering out the 
spatial modes of turbulence. The response of interferom- 
eters is therefore very stable and the issue of estimating 
the accuracies of non-biased data is raised. The final visi- 
bility estimate is a complex quantity as it is a non-linear 
mix of noisy measurements and of parameter estimates 
with their own uncertainties. Estimating the stability of 
the instrument, a crucial point for calibration, and the fi- 
nal error on visibilities is therefore non-trivial and must be 
considered with caution. Moreover, data analysis mainly 
consists of model fitting the final visibilities and the mat- 
ter of their potential correlations becomes important, es- 
pecially if some very faint structures are looked for, as is 
the case in extra-solar planet detection. 
In this paper I propose a method to meet these challenges. 
The method has been tested and elaborated along with the 
FLUOR interferometer, the first single-mode interferome- 
ter. This method was first published in |Perrin (1996)| and 



used in Perrin et al. (1998) It is updated and improved 



in this paper by accounting for correlations. 
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2. Principles of data reduction and calibration 

In this section the general scheme of data reduction is 
reviewed to introduce the vocabulary and notations. Two 
main steps are to be considered. In the first one (Sect. 2.1), 
the fringe processing, fringe contrasts are derived from raw 
signals. Because of contrast losses, fringe contrasts are cal- 
ibrated in a second step (Sect. 2.2) to provide the visibil- 
ities directly linked to the spatial intensity distribution of 
the source. 



2.1. Fringe contrasts estimates 

In the following we will distinguish between the fringe con- 
trast obtained on a source and the visibility of this source. 
The fringe contrast measured from a single exposure or 
scan is called the coherence factor and is noted \i whereas 
the visibility is noted V. 

Whatever the beamcombining technique, fi being the 
modulus of a complex number, unbiased estimators are 
only obtained for squared quantities from wich biases due 
to additive noise can be subtracted. In the future, phase 
referencing techniques may allow one to directly measure 
complex visibilities (real and imaginary parts) but this is 
not the case yet and I will only consider measurements of 
fringe contrast moduli. The result of the processing of a 
series of scans on a single source is a series of realizations 
of the /x 2 estimator or is an average value of the realiza- 
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Fig. 1. Examples of squared coherence factor histograms obtained with FLUOR in one of the interferometric channels. 
About 100 intcrfcrograms have been recorded for each object. The mean and rms of individual measurements are given 
for this channel. The correlation factor r measures the noise correlation between the two interferometric channels. The 
amount of atmospheric piston is decreasing from the left to the right. 



tions with a la error bar if their statistical distribution 
can be trusted to be Gaussian. 

2.2. Necessity for a calibration 

The average ^ 2 is not directly an estimator of the squared 
modulus of the visibility of the source because some physi- 
cal phenomena degrade the coherence factor. Among these 
phenomena, polarization mismatches between the inter- 
ferometric beams are the most common. Without perfect 
beam cleaning by a fiber, atmospheric turbulence also de- 
grades the fringe contrast. It is necessary to estimate the 
loss of coherence on a calibrator source for which the visi- 
bility is known. A transfer function T is obtained by com- 
puting the ratio of the measured coherence factor \i c to 
the expected visibility V exp . With squared quantities this 
yields: 



V 2 ' 

r exp 



(i) 



If the instrument is stable enough then the estimate of 
T 2 obtained on the calibrator can be used to derive an 
estimated visibility for the source from the measured co- 
herence factor: 



V 2 = ^ = coT 2 ^. 



(2) 



a perfect single-mode interferometer in which single-mode 
fibers are used to filter the phase aberrations produced by 
atmospheric turbulence (except for the piston mode) these 
non-stationary effects are eliminated. In order to avoid in- 
stabilities due to the piston mode, the fringes are scanned 
at a frequency far above the characteristic frequency of 
piston. In interferometers where the piston is stabilized 
by a fringe tracking servo loop, this issue is solved. The 
remaining main sources of variation of the transfer func- 
tion are basically temperature drifts and differential po- 
larisation effects due to the change of beam inclination 
on the first mirrors with changing positions of the sources 
in the sky. In both cases the transfer function drifts are 
very slow and a good estimate of the transfer function can 
be obtained by interpolating two estimates bracketing the 
source to be calibrated. This has been demonstrated with 
the FLUOR beamcombincr, as will be shown is Sect. 7.2. 
In the following we will therefore consider that the effi- 
ciency of the interferometer is continuously assessed by 
observing calibrators before and after science sources. We 
will not consider the case where the transfer function is 
derived by averaging individual transfer functions on a 
large temporal scale, as this is not required for a single- 
mode interferometer. This technique does not allow one 
to assess the quality of the calibration in detail. 



where I call coT 2 the squared co-transfer function. Its use 
will be detailed in Sect. 4. 



2.3. Assumptions in the case of a single-mode 
interferometer 

The calibration process may fail if the assumption that 
the transfer function is stable is wrong. This usually hap- 
pens if non-stationary processes like atmospheric phase 
turbulence play a role in the fringe formation process. In 



3. Estimating fringe contrasts 

This section focuses on estimating the statistical prop- 
erties of fringe contrasts. I will not describe the method 
to compute coherence factors from single exposures and 
I will refer the reader to appropriate articles in the next 
paragraphs. 
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Fig. 2. Examples of squared co-transfer functions measured with FLUOR. The two curves for each night correspond 
to the two interferometric channels of the coaxial interferometer. The full circles are the squared co-transfer functions 
measured on calibrators whereas the open circles are the values interpolated at the time when the science targets were 
observed. 1 a error bars are displayed. 



3.1. Single-channel spatial modulation interferometer 

In a multiaxial interferometer, distant parallel beams feed 
a focusing optic. The beams are recombined in the focal 
plane where they overlap at the focus locus. The modula- 
tion is spatial as the fringe phase varies across the diffrac- 
tion pattern. A method to derive fringe contrasts has been 



published by Mourard et a l. (1994) in the case of GI2T. 
The method has been adapted to AMBER which is a 
single-mode multiaxial interferometer (Chelli et al. 2000). 
Thanks to the filtering of the non-stationary modes of tur- 
bulence, the statistics of /it 2 can be well approximated by 
a Gaussian distribution. This will be demonstrated in the 
case of the data obtained with FLUOR in Sect. 7.1. The 
estimate of the squared coherence factor is therefore the 
mean of the distribution of the realizations denoted /i 2 . 
An unbiased estimate of the variance of individual mea- 
surements is: 



S 2 = 



N-l 

-E 

n=0 



N- 1 



04 -m 2 )- 



(3) 



The estimate of the variance of the coherence factor esti- 
mator jj? is then: 



(4) 



3.2. Two-channel temporal modulation interferometer 

In a coaxial interferometer, beams are superimposed in po- 
sition and in direction. This can be realized with a beam- 
splitter or with a fiber coupler. A relative phase between 
the beams is introduced by setting an optical path differ- 
ence. This is achieved with a moving mirror in one of the 
two beams, hence the temporal modulation of the phase. A 
method to compute fringe contrasts for this type of inter- 



ferometer is described in Coude du Foresto et al. (1997) 



A more recent method based on wavelets analysis has been 
proposed by |Segransan et al. (19997] The method to ob- 
tain an estimate of the coherence factor without the pho- 
ton noise bias is explained in Perrin (2002) A prototype 



instrument for this kind of interferometer is the FLUOR 
beamcombiner . 

The difference with the previous interferometer of Sect. 3.1 
is that it produces two interferometric beams and there- 
fore two sets of coherence factors estimates. The statistics 
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10 20 30 40 50 60 

Spatial frequency (cycles /arcsec) 

Fig. 3. Example of visibility fit. The source is SW Vir and the model is a uniform disk. Errors are 1 a errors. Open 
circles and dashed line are the visibilities and model fit computed without taking correlations into account. Full circles 
and continuous line are the equivalent with the method described in this paper. The two cases are separately presented 
in the little windows. 



of each set can be well approximated by a gaussian statis- 
tics as will be shown in Sect.0 The photon noises of the 
two interferometric signals are uncorrelated. The read-out 
noises are generally considered uncorrelated but some cor- 
relation may occur as different pixels share the same read- 
out electronics. In addition the two beams suffer from the 
same turbulence effects (residual piston and photometric 
beam fluctuations) which generate some noise in the mea- 
surements. Part of the noise is therefore common to the 
two signals and the coherence factors estimates are corre- 
lated. The correlation factor r is directly estimated from 
the ji 2 distributions: 



y/Var&tiVar&j)' 



(5) 



4. Estimating the transfer function 

It is assumed that the transfer function is a slowly varying 
function which is rapidly sampled. This property will be 
illustrated with real data in Sect. [3 It is then legitimate 
to linearly interpolate the squared transfer function at the 
time when the science source was observed. Because the 
variances of products of random variables are more eas- 
ily calculated than those of ratios, the reciprocal of the 
squared transfer function, the squared co-transfer func- 
tion, is interpolated instead of the squared transfer func- 
tion. The use of one or the other is equivalent. In order to 
be general, two interferometric outputs are always consid- 
ered. The particular case of the multiaxial interferometer 
will be considered in discussions. The expression of the in- 
terpolated co-transfer functions in the two outputs of the 
instrument is : 



where the subscripts describe the two interferometric 
channels. 



COT 2 =*J£ + (l- a ;)£ r 



Mi 



coT? 



m 2 m 2 



(6) 
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Fig. 4. Examples of fits of visibility data with a uniform disk model. All correlations are taken into account. Errors 
are 1 a errors. The visibility point around 85 cycles/arcsec in the G Her panel is not taken into account in the fit. 



V a and V b are the expected visibilities of calibrators A 
and B. Coefficient x is the relative time distance between 
the observation of the science target and the observation of 
calibrator A. The expected visibilities are supposed to be 
Gaussian random variables. Dropping the channel indices, 
the variances of the squared co-transfer functions are equal 
to: 

Var(coT 2 ) = x 2 Var{coT a2 ) + (1 - x) 2 Var(coT b2 ) (7) 

when the two calibrators are different. When the two cal- 
ibrators are the same then the variance is equal to: 

Var(coT 2 ) = x 2 Var(coT a2 ) + (1 - x) 2 Var{coT b2 ) (8) 
+2 X ^Z^ Var(V a2 ) 

The squared co-transfer functions estimated on the cali- 
brators are ratios of Gaussian estimators. These new ran- 
dom distributions arc not Gaussian. They are Cauchy 
distributions (the density probability of which is a 
Lorentzian) with no mean and no variance. By analogy 
with the standard deviation of a Gaussian law, an esti- 
mate of the uncertainty can be derived from the width of 



the confidence interval. The upper and lower limits of the 
confidence interval at 68.3% for a ratio of two Gaussian 
random variables a and (3 are given by ( |Pelat (19 92) ): 

13 t ((«/».)'+'</^)'-l)" , a " 

This interval is not symmetric. I choose as error bar the 
larger distance of the mean to the limits, thus slightly 
overestimating the error. The probability that the true 
value is in this interval around the mean is therefore larger 
than 68.3%. In the following I will consider that ratios 
of Gaussian variables are Gaussian variables. This is not 
rigourously true but this allows one to derive expressions 
otherwise difficult to handle. The Gaussian and Lorentzian 
functions are both bell curves, the wings of the latter be- 
ing more extended than that of the Gaussian. The ap- 
proximation amounts to giving more weight to the center 
of the lorentzian. The other possibility would be to apply 
Monte-Carlo or bootstrap methods to all error bar com- 
putations, which would make data reduction a very long 
process for a very limited gain. Nevertheless, this method 
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will have to be applied once to compute the correlation be- 
tween the visibilities of a two-channel coaxial interferom- 
eter (Sect. EJ. Consistency of error bars will be addressed 
in Sect. This final consistency is the justification of the 
approximations performed. 

5. Estimating visibilities 

5.1. Mean and variance of channel visibilities 

The single-channel squared visibility in the case of a mul- 
tiaxial interferometer or the two-channel squared visibili- 
ties of a coaxial interferometer are simply the product of 
the science target squared coherence factors and squared 
co-transfer functions of Eq. © yielding: 



V? = xJ±=V a2 + (1 - x)J^V b2 

v? = x£=v a2 + (i - x )£=v b2 



(10) 



The variances of the two-channel squared visibilities are 
calculated with the variances of the squared co-transfer 
functions and of the squared coherence factors with the 
following formula : 

Var{AB) = Var(A)Var{B) + A 2 Var{B) (11) 
+B 2 Var(A) 

We therefore have: 

Var(V?)= VarQ4)Var(coT?) +]4Var(coT?) (12) 

+coT 2 Var(J4) 
Var{v£) = Var(/4)Var{coTi) +]4Var(coTi) 

+coT 2 Var(j4) 

5.2. Correlation between two channel visibilities 

The above equations define the uncertainties on the chan- 
nel estimates of the visibilities. In the case of a multiaxial 
interferometer, this is the final estimate of the visibility. 
In the case of coaxial interferometers, the two estimates of 
the visibility need to be averaged at this stage. For this, 
it is necessary to assess the correlation factor between the 
two estimates. By definition, the correlation factor is equal 
to: 



P12 



vi)) 



y/Var(V?)Var(V£) 



(13) 



This quantity is defined by sums, ratios and products of 
ten random variables. The correlation factor has to be 
computed with a Monte-Carlo method by simulating each 
random variable from its mean and variance (assuming 
it has Gaussian statistics) and by correlating the series 
of V 2 and V 2 . In the special case when the correlations 
between measured quantities can be neglected because the 
correlated noise level is far below that of the uncorrelated 
noise, it is to be noticed that there is still a correlation 
due to the common estimated values of the calibrators 



expected visibilities in the two interferometric channels. 
This can be illustrated by the equation below when the 
first and second calibrators are different: 



P12 = 



x iJ±J±Var{V a2 ) + (1 - x) 2 4t4? Var ( Vb2 ) 

Mi ^2 



fi 2 14* 



yJVar(V?)Var(V?) 



(14) 



When the two calibrators are the same the correlation fac- 
tor has the particular expression (for sake of simplicity the 
two estimates of the visibility at slighlty different baselines 
are supposed to be the same): 



Pi2 = - 



*#f + (l-*)% 



*=ff + (l-z)=§f 



Var(V a2 ) 



yJVar{V?)Var{V£ 



-(15) 



It is now easy to see from Eq. (f 101) that if the noise on the 
measurements is negligible with respect to the uncertain- 
ties on the expected visibilities of the calibrators then the 
correlation tends towards 1. In this case, the second in- 
terferometric channel brings no extra information except 
a consistency check and the precision on the visibility of 
the science target is directly proportional to the precision 
on the expected visibilities of the calibrators. 

5.3. Comments on visibility variances 

It is also interesting to analyze the propagation of noises 
in the visibility estimates. For example, if the noise on the 
measurements is negligible, it is possible to evaluate the 
amount of variance due to the uncertainty on the calibra- 
tors visibilities (or diameters). Dropping channel indices I 
obtain: 



Var{V 2 )\ 



+ (l-x) 2 



Var(V a2 ) 



(16) 



Var{V b ) 



If the uncertainties on the calibrators ' squared visibilities 
are equal to 1% and if the coherence factors are all equal 
to 1 and observations are equally spaced in time then the 
uncertainty on the measured squared visibility is equal 
to 0.7%. This equation also shows that the smaller the 
visibility of the calibrator, the more amplified the noise 
on its expected visibility is. Symmetrically, the smaller the 
visibility of the science target, the smaller the contribution 
to the noise of the calibrators' expected visibilities. 

5.4. Final estimate of the visibility in a two-channel 
interferometer 

The final squared visibility V 2 is estimated from the two 
squared visibilities obtained from each output of the in- 
terferometer V 2 and V 2 2 and their respective variances (or 
equivalently uncertainties <r\ and o^)- I define the final 
estimate V 2 as being the least squares fit estimator of 
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the squared visibility as this is an optimal estimator for 
Gaussian random variables. In this fit, the model is lin- 
ear and has only one parameter: V 2 . Let us call C the 
covariance matrix of V 2 and V 2 2 : 



C 



07 



Pl2C r l0'2 



<J 2 



(17) 



The quantity to minimize in the least squares fit can then 
be written: 



S(V 2 



V 2 - V 2 
V 2 - V 2 



C" 



V 2 - V 2 
V 2 - V 2 



r*C _1 F. (18) 



It can be shown that the minimum is reached for: 



v 2 = (x^^x)-^^- 1 

with 

X = 



v 2 



the uncertainty on V 2 being: 

= (x t c- 1 x)- 1 x t c- 1 . 



2 

a V 2 



(19) 



(20) 



(21) 



The above equations yield the final visibility estimate: 



o{oi, - 2pi2cricr2 



and the associated error: 



'v 2 



(1 - A^ajaj 

°1 + °2 ~ 2pi 2 (7i(72 



(23) 



If P12 = 1 then the two single-output squared visibilities 
V 2 and V 2 are fully correlated and the above expression 
does not apply. In this case V is equal to one of the two 
single-output visibilities with its associated error bar. 
The quality of the fit is expressed by the \ 2 '■ 



2 >2 



X 



(V 2 V 2 ) 



4 



2/012(71(72 



(24) 



This parameter is important because it allows us to check 
the consistency of the instrument and of the method to 
measure the visibilities and the error bars. If all assump- 
tions are correct then the % 2 should be equal to 1 on aver- 
age. In the FLUOR software we use this number as a data 
quality parameter. Data with x 2 greater than 3 should 
be examined in detail and rejected for science programs 
requiring a very good quality of calibration as the proba- 
bility to get a value larger than 3 is only of 8.33%. 

5.5. Correlations of multiple baseline interferometer 
simultaneous visibilities 

Coherence factors recorded simultaneously on different 
baselines with telescopes in common may also be corre- 
lated. This correlation should be taken into account and 
saved with the reduced data in the form of a correlation 



matrix. The correlations may be as high as the correla- 
tions between the two channels of a coaxial interferometer 
as all calibrators are common to all baselines. The method 
used in Sect. 15.21 should be applied. A correlation matrix 
for the p 2 should be computed first. The final correlation 
factors for the final visibility estimates are then computed 
with a Monte-Carlo method. 

6. Correlations between non-simultaneous 
visibilities 

Visibilities obtained on different baselines or on different 
days are usually considered independent. In the last para- 
graph, we focused on the possible correlations of visibili- 
ties recorded simultaneously on baselines with telescopes 
in common. In this section, we will consider the correla- 
tion due to common uncertainties in the calibration pro- 
cess for independent baselines or for visibilities measured 
at different times. The calibration of the transfer function 
may have required us to use the same calibrators hence the 
same diameter estimates. The errors on the visibilities are 
therefore not independent. It is the purpose of this para- 
graph to establish a method to compute this correlation 
and, more important, to be able to trace it to compute it 
a posteriori long after taking the data at the telescopes. 
Let Si and S 2 be two spatial frequencies at which squared 
visibilities V^ 2 (5i) and V 2 (S 2 ) have been measured. The 
visibility estimates of Eqs. (|10fl and (|22|l can take the form: 



V 2 {S 1 ) = aiU a2 (Si)+/3il/ b >i) 
V 2 {S 2 ) = a 2 V c2 {S 2 )+(3 2 V d2 (S 2 ) 



(25) 



where the calibrators are A, B, C and D. To save room, the 
calibrator visibilities are replaced by the capital letters. It 
can be shown that the correlation factor between the two 
squared visibilities is: 



p(V 2 ( Sl ),V 2 (S 2 )) = ^W*r(A)Var(B2p(AQ 
^Var^iS^VariV 2 ^)) 
a 1 p 2 ^/Var{A)Var(D)p(A, D) 
X ^JVar^iS^VariV 2 ^)) 
Pia2>JVar(B)Var{C)p(B,C) 
^/Var^iSiWariV 2 ^)) 
(3iP 2 ^Var(B)Var{D)p{B, D) 
v /yar(F 2 (5i))Uar(F 2 (5 2 )) 

When all calibrators are different, all correlations are zero. 
The correlation is maximum when a single calibrator has 
systematically been used. Measurement noise is present at 
the denominator only and the correlation is of course all 
the larger as the measurement noise is smaller. 
The correlation between two expected squared visibilities 
at two different baselines is not easy to evaluate analyti- 
cally. Besides, it may depend upon the model of the cali- 
brator. A computation can be performed which shows that 
the correlation is indeed equal to 1 with an excellent ac- 
curacy as long as no baseline is equal to 0. This can also 
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be shown by expanding the visibility function. Thus, the 
expected visibilities derived from a uniform disk model of 
a same calibrator at two different baselines are fully cor- 
related to the first order. This holds as long as the second 
derivative of the model is small (which in the case of the 
uniform disk model is true except close to the zeros of the 
model) and as a condition, none of the baselines is very 
close to zero. In practice, the error on the diameter being 
usually small (less than 5%), the first order approxima- 
tion is valid and the two expected squared visibilities can 
therefore be considered fully correlated. This is true down 
to very short baselines as for example for a diameter of 
10 ± 0.5 mas the correlation starts to decrease for a base- 
line below 5 cm. 

For practical use, Eq. (|26|1 can be simplified as the corre- 
lations between expected visibilities are either or 1 when 
the calibrators are respectively different or alike. The only 
requirement to compute this correlation is therefore that 
the variances of the expected visibilities and the coeffi- 
cients a and (3 be saved with the reduced data. These cor- 
relations will have to be computed to model fit the data. 
The generalization of the Levenberg-Marquardt method 
with correlated data is given in the Appendix at the end 
of the paper. 

7. Validation of the method 

Examples of data reduction results and calibrations are 
presented. The quantities introduced in the previous sec- 
tions are discussed in practical situations and general com- 
ments on observing strategies are expressed. 

7.1. Squared coherence factors statistics 

I have plotted in Fig^three examples of (x 2 distributions. 
In the case of V636 Her, the fringe speed puts the fringe 
frequency far above the turbulence piston spectrum. The 
piston is almost frozen during each scan and the amount of 
correlated noise is small. In the case of 71 UMa, the fringe 
speed is lower and the measurements are more sensitive 
to piston hence the higher correlated noise. 5 Sge is an 
intermediate case. In all three examples, the distributions 
of li 2 are compatible with Gaussian distributions hence 
validating the basic assumption on the statistics of the li 2 . 
An important fact is that the amount of correlated noise 
is not negligible and must be taken into account. However, 
a test on distributions is performed to detect deviations 
from Gaussian statistics. Deviations are not common and 
are always due to instrumental problems. In such cases, 
depending on the required level of data quality, data may 
be eliminated. 

7.2. Examples of transfer functions 

Figure |3 presents two examples of squared co-transfer 
functions. Full circles are measurements on calibrators 
whereas open circles are interpolations for science targets. 
It is visible that the co-transfer function is not always sta- 



ble and may experience variations. In some cases like on 
May 15, 2000 at 8:07, an error of calibration may have 
happened as the co-transfer functions jump by a few per- 
cent. Yet, in most cases, the transfer functions variations 
are slow on time scales of a few hours and variations can 
be well approximated to the first order. Data collected on 
May 22, 2000 show that this is still the case when the cal- 
ibrator diameters are known with a very good precision. 

7.3. Discussion of model fitting and examples 
7.3.1. Amount of correlation 

Before presenting examples let us summarize the different 
levels of correlations we have encountered so far: 

1. correlation of coherence factors (coaxial beamcombin- 
ers) 

2. correlation of interferometric channels (coaxial beam- 
combiners) 

3. correlation of simultaneous baselines 

4. correlation of non-simultaneous baselines 

The first level (r) was addressed in Sect. I7.ll The amount 
of correlation between interferometric channels (/O12) for 
a coaxial beamcombiner like FLUOR varies from a few 
percent for faint sources calibrated by very well-known 
calibrators to almost 100% for bright sources calibrated 
by sources whose diameters are known with an accuracy 
of a few percent. The two channels are therefore not fully 
independent in this case and it is important to check the 
X 2 defined by Eq. (|24|l . A large x 2 may indicate that ei- 
ther the assumptions on Gaussian statistics were wrong for 
these particular data or that the transfer function varia- 
tion is not well measured. In either case, data should be 
examined in detail to decide whether the visibility value 
can be used or not. A blind method is to reject visibilities 
with a x 2 above a certain level that can be of 3 for difficult 
programs or relaxed to a larger value for easier programs. 
It is important to note that if the transfer function has 
varied accordingly in both channels at the time the sci- 
ence target was observed by an amount larger than the 
error bars then this x 2 test will fail to detect it. It can 
only be detected if the variations are opposite in the two 
channels. This is certainly a weakness. 
It will be interesting to assess the level of correlations of 
visibilities measured with multiple beam interferometers. 
It can be anticipated that it will not be negligible and will 
be of the same level as pi2- 

The importance of correlation between visibilities 
recorded separately is illustrated in Fig. EI The data have 
been reduced in two different ways. Data plotted with 
open circles and fitted by a dashed-line uniform disk model 
are reduced without taking correlations into account. Data 
plotted with full circles and fitted by the continuous line 
were reduced with the method of this paper. In the first 
case, the fit is of very good quality with a x 2 smaller than 
1. Yet, all visibilities have been calibrated with the same 
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source, hence a strong correlation between visibility values 
as the 3x3 correlation matrix shows: 



C 



1 0.96 0.96 
0.96 1 0.97 
0.96 0.97 1 



(27) 



It is to be noticed that the p\2 correlation factor is larger 
than 90% for all three visibilities, a large fraction of this 
correlation being due to the common calibrator. If corre- 
lations are ignored then noise is considered independent 
from one visibility to the other and this is why the first x 2 
is smaller, as a large global noise is now interpreted as a 
large fluctuating noise from one visibility to the other. On 
the contrary, when correlations are used, a tiny fraction 
of noise (4% at most) can be considered a fluctuation giv- 
ing degrees of freedom for the adjustement of the model. 
This is equivalent to reducing error bars on visibilities by 
96% in the fit. The common noise due to the uncertainty 
on the calibrator is then a simple common bias on the 
visibilities but does not contribute to the noise in the fit, 
hence the much larger \ 2 ■ In the zoomed part of this same 
figure, one can see that the fit now conforms to only one 
of the visibility data as the correlation matrix is close to 
being non-invertible (see the Appendix for the use of the 
correlation matrix in the fitting process). In more physical 
terms, the correlations being very close to one, all data are 
equivalent and the fit can be derived from one of the visi- 
bility data. If all data points were compatible despite the 
large correlations then the best fit curve would go through 
the error bars. It is not the case here and this is why the 
X 2 is large. 

7.3.2. Examples of visibility accuracies 

Other examples of model fitting are presented in Fig. 0] 
The BK Vir data were calibrated with the same calibrator 
(the correlation matrix is similar to the matrix above) as 
SW Vir but the visibilities are very consistent with each 
others. Data for the three other sources are either totally 
independent or slightly correlated. Only the first lobe data 
were used for the fit of G Her. These four examples show 
very good fits and consistency of data. In particular, this 
sets the best absolute accuracy of the calibration of visi- 
bilities with FLUOR to 0.004 (equivalent to an accuracy 
of 0.004 on V 2 with V = 0.5 as a v i = 2a v .V to the first 
order). 

7.3.3. Calibration strategies 

It is important to adapt the strategy of calibration to the 
type of astrophysical studies addressed with optical inter- 
ferometers. For most studies where visibility accuracies of 
a few percent are acceptable, the repeated use of a single 
or of a few calibrators is possible. For difficult programs 
like exoplanet detection, a very high level of accuracy is 
required and the strategy needs to be well prepared. Two 
cases may arise depending on whether the required cal- 
ibration of visibilities is absolute or relative. If absolute 



accuracies better than 0.001 have to be obtained on visi- 
bilities then it is very likely that no calibrator can be used 
twice, unless the error on the expected visibility of this 
calibrator is less than the level of accuracy required. This 
would suppose that the visibility model of the calibrator 
be measured first. Another possibility is relative detection. 
As illustrated by the example of SW Vir, if the same cal- 
ibrator is systematically used, the fit is sensitive to very 
low levels as the correlated noise does not contribute to 
the value of the \ 2 . In this example, a departure from the 
uniform disk model or a calibration error may have been 
detected to a level much lower than the error bars. For 
very faint detail detection, this can work if the visibility 
curve of the calibrator is smooth and without wiggles of 
similar amplitude as the ones searched for on the science 
target. 

In any case, the observing strategy should be prepared in 
advance and should take the problem of data correlations 
into account. 



8. Conclusion 

I have proposed in this paper a method to calibrate visibil- 
ity data obtained with single mode interferometers. The 
single mode character is required to make valid the as- 
sumption that the statistics of coherence factors data are 
Gaussian and stationary. It is possible to derive reliable er- 
ror bars if all correlations are considered in the derivation 
of all estimators. Correlations also need to be taken into 
account when fitting the data by models. The validity of 
the method has been demonstrated on real interferomet- 
ric data recorded with FLUOR. An important conclusion 
of this work is that the strategy of calibration has to be 
adapted for specific programs requiring high standards of 
calibration. 



References 

Chelli, A., 2000, AMBER report AMB-IGR-017 

Coude du Foresto, V., Ridgway, S.T., Mariotti, J.-M., 1997, 

A&ASS, 121, 379 
Mourard, D., Tallon-Bosc, I., Rigal, F., et al. 1994, A&A 288, 

675 

Pelat, D., 1992, Cours "Bruits et Signaux", Ecole Doctorale 

d'lle de France, Astronomie-Astrophysique 
Perrin, G., 1996, PhD thesis, Universite Paris VII 
Perrin, G., Coude du Foresto, V. Ridgway, S.T., et al., 1998, 

A&A 331, 619 
Perrin, G., 2002, A&A, in press 

Press, W.H., Flannery, B.P, Teukolsky, S.A., Vetterling, W.T., 
1988, "Numerical Recipes in C", Cambridge University 
Press 

Segransan, D., Forveille, T., Millan-Gabet, R., Perrier, C, 
Traub, W.A., 1999, "Working on the Fringe: Optical and IR 
Interferometry from Ground and Space", ASP Conference 
Vol. 194, p. 290, S. Unwin and R. Stachnik Eds. 



10 



G. Perrin: The calibration of interferometric visibilities obtained with single-mode optical interferometers 



Appendix A: Practical implementation of model 
fitting with correlated visibilities 

Algorithms for model fitting are well known. One of the 
most commonly used is the Levenberg-Marquardt algo- 
rithms. An example of practical implementation is given 
in Press et al. (1988) in the case where data are not cor- 



related. Here, I give a generalization of this algorithm to 
the case of correlated data. 



A.l. Definitions 

Let (V^)i—i t .,. t jsr be a series of squared visibility measure- 
ments of an astronomical source. Let C be the matrix of 
variances-covariances of these measurements: 



C = 



0\ Pl,2&l<72 
Pl,2 a l a 2 &2 



'N 



(A.l) 



Let (Mj(^))j = i ! ... ! jv be the model to fit the data with 9 a 
vector of p parameters. It is assumed that the noise on the 
squared visibilities is Gaussian. 

A. 2. Method 

The best estimates of the parameters in the sense of the 
highest likelihood are obtained by maximizing the likeli- 
hood function of the Y = [V 2 — M(9)] vector which leads 
to minimizing the following functional: 



S(0) 



1 



The optimum 9 is obtained when: 

88 U 

^4 > o 

Besides, the covariance matrix of the estimated parame- 
ters is asymptotically (N — ► +oo) equal to: 



(A.2) 



(A.3) 



Var{9) 



'o 2 s 




09 2 


8=8- 



(A.4) 



The search for the optimum can be performed with a gen- 
eralized Levenberg-Marquardt algorithm. 

A.3. Generalized Levenberg-Marquardt algorithm 

Close to minimum, S can be expanded to the second order 
in 9: 



S(9) = S(9)- 



~dS~ 


t 


9-9 


1 


9-6 


t 


'd 2 s~ 




9-9 


80 _ 






+ 2 






d9 2 







d 2 S 
d9 k d9. 



dM 



d6 k 
kj = 



C 



_ 1 dM 
09, 



d 2 M 
d6 k d9. 



C _1 [V 2 - M] 



1. 



,P 



In case S is equal to its second order expansion then 9 can 
be guessed directly through: 



(A.7) 



If the expansion is a poor approximation of S then a steep- 
est descent method has to be applied with: 



~d 2 S~ 


-i 


'dS~ 


89 2 _ 




36 



6 — constant x 



OS 



09 



(A, 



This the basis of the Levenberg-Marquardt method. It is 
classical to use the following notations: 



0k = -^7T 



Olkl = 



OS 
d6 k 
dM' 



k = l,...,p 

_ x dM 



(A.9) 
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89, 



k,l = 1,. 



,p 



where §M- is the k th column of the matrix: 

Offc 



dM 



dM(xi,0) 
o%. 



(A.10) 



The matrix equivalent of the above definition is therefore: 



dM 



d9 
dM 



09 



cr 1 [v 2 - M] 

_ x dM 



(A.ll) 



c 



09 



At optimum the variance-covariance matrix of the param- 
eters is: 



Var(9) 



(A.12) 



With these definitions, 2S(9) is a \ 2 with N — p degrees 
of freedom. The increment in parameter space is therefore 
the solution of the linear system: 



p 

E 



olmS6i = f3 k 



(A.13) 



when the quadratic form is a good approximation to S 
otherwise the increment is given by: 



= constant x /3; 



(A.14) 



The reader is referred to Press et al. (1988) for the imple- 
mentation of this modified algorithm. 



(A.5) 



The first and second derivative of S are respectively a vec- 
tor and a matrix whose elements are: 



OS 
06~ k 



OM 
OK 



C- 1 [V 2 - M] ,k = l,...,p (A.6) 



